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The "law of propagation of error" is a tool that physical scientists have conveniently and frequently 
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Introduction 

In the December 1939, issue of the American 
Physics Teacher, Raymond T. Birge wrote an ex- 
pository paper on "The Propagation of Errors." 
In the introductory paragraph of his paper, Birge 
remarked: 

"The question of what constitutes the most reliable value to 
be assigned as the uncertainty of any given measured quantity is 
one that has been discussed for many decades and, presumably, 
will continue to be discussed. It is a question that involves many 
considerations and by its very nature has no unique answer. The 
subject of the propagation of errors, on the contrary, is a purely 
mathematical matter, with very definite and easily ascertained 
conclusions. Although the general subject of the present article 
is by no means new, 1 many scientists still fail to avail themselves 
of the enlightening conclusions that may often thus be reached, 
while others frequently use the theory incorrectly and thus arrive 
at quite misleading conclusions."' 

Birge's remark 27 years ago still sounds fitting today. 
For a number of years, the need for an expository 
paper on this topic has been felt by the staff of the 
Statistical Engineering Laboratory at the National 
Bureau of Standards. Frequent inquiries have to 
be answered, yet a diligent search in current litera- 
ture and textbooks failed to produce a suitable ref- 
erence that treats the subject matter adequately. 
The present manuscript was written to fill this need. 

In section 1, we consider the two distinct situations 
under which the propagation of error formulas can 
be used. The mathematical manipulations are the 
same, yet the interpretations of the results are en- 
tirely different. In section 2 the notations are de- 
fined and the general formulas given. Frequently 
used special formulas are listed at the end of the 
section for convenient reference. In section 3 the 
accuracies of the approximations are discussed, 
together with suggestions on the use of the errors 
propagated. Section 4 contains suggestions on the 
reporting of final results. 



1 See, for instance, M. Merriman, Method of Least Squares, pp. 75-79 (ed. 8, 1910). 



The "law of propagation of error" is a tool that 
physical scientists have conveniently and frequently 
used in their work for many years. No claim is 
made here that it is the only tool or even a suitable 
tool for all occasions. "Data analysis" is an ever- 
expanding field and other methods, existing or new, 
are probably available for the analysis and inter- 
pretation for each particular set of data. Never- 
theless, under certain assumptions given in detail 
in the following sections, the approximations resulting 
from the use of these formulas are useful in giving 
an estimate of the uncertainty of a reported value. 
The uncertainty computed from the use of these 
formulas, however, is probably somewhat less that 
the actual in the sense that no function form is known 
exactly and the number of variables considered usually 
does not represent fully the contributors of errors 
that affect the final result. 

1 . Statistical Tolerancing Versus Imprecision 
of a Derived Quantity 

1.1. Propagation of error formulas are frequently 
used by engineers in the type of problem called "Sta- 
tistical tolerancing." In such problems, we are 
concerned with the behavior of the characteristic 
W of a system as related to the behavior of a charac- 
teristic X of its component. For instance, an engi- 
neer may have designed a circuit. A property W 
of the circuit may be related to the value X of the 
resistance used. As the value of X is changed, 
W changes and the relationship can be expressed 
by a mathematical function 

W = F(X) 

within a certain range of the values of X. 

Suppose our engineer decides on W = Wq to be the 
desired property of the circuit, and specifies X = Xo 
for this purpose. He realizes, however, that there 
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will be variations among the large lot of resistors he 
ordered, no matter how tight his specifications are. 
Let x denote the value of any one of the resistors 
in the lot, then some of the time x will be below 
xo, while at other times x will be above x . In other 
words, x has a distribution of values somewhat clus- 
tered about xq. As x varies with each resistor, so 
does w with each circuit manufactured. 

If our engineer knows the mean and standard 
deviation (or variance) of x, based on data from the 
history of their manufacture, then he can calculate 
the approximate mean and variance of w by the 
propagation of error formulas: 



mean (w) — F(mean x), and 



variance (w) — 



dF 



dX 



var (#), 



(i.i) 



where the square brackets signify that the derivatives 
within the brackets are to be evaluated at the mean 
of x. The approximations computed refer to the mean 
and variance of an individual unit in the collection of 
circuits that will be manufactured from the lot of 
resistors. The distribution of values of w, however, 
is still far from being determined since it depends 
entirely on the functional form of the relation between 
W and X, as mathematical variables, and the distri- 
bution of x itself, as a random variable. This type of 
approach has been used frequently in preliminary 
examinations of the reliability of performance of a 
system, where X may be considered as a multidimen- 
sional variable. 

1.2 Let us consider now the second situation under 
which propagation of error formulas are used. This 
situation is the one considered in Birge's paper, and 
is the one that will be discussed in the main part of 
this paper. 

A physicist may wish to determine the "true" value 
Wo of interest, for example, the atomic weight of silver. 
He makes n independent measurements on some re- 
lated quantity x and calculates 



1 



%n ' 



(jti-hx 2 + . 



+ X n ) 



as an estimate of the true value xq and 



— ^ (Xi~ Xnf 



as an index of dispersion of his measured values. The 
physicist is mainly concerned in obtaining an estimate 
w of w , and of the standard deviation of w as a measure 
of precision of his result. He therefore computes by 
the propagation of error formulas: 



var (w) ~- 



dF 



dX 



n 



(1.2) 



o~w = Vvar (w) 



Often he assumes that w is distributed at least approxi- 
mately in accordance with the normal law of error and 
gives probability limits to the statistical uncertainty of 
his estimate w based on the standard deviation calcu- 
lated (or w ) and this assumption. 

Cramer [1946] has shown that under very general 
conditions, functions of sample moments are asymp- 
totically normal, with mean and variance given by 
the respective propagation of error formulas. 2 Since 
x n is the first sample moment, the estimate w will be 
approximately normally distributed for large n. Hence 
our physicist is interested in the variance (or the 
standard deviation) of the normal distribution which 
the distribution of F(x n ) approximates as n increases. 
(Note that both estimators w and var (w) are functions 
of n.) For n large, the distribution of w can be as- 
sumed to be approximately normal and probability 
statements can be made about w. 
1.3 Hence, we have the two cases: 

(1) The problem of determining the mean and vari- 
ance (or standard deviation) of the actual distribu- 
tion of a given function F(x) of a particular random 
variable x, and 

(2) The problem of estimating the mean and vari- 
ance (or standard deviation) of the normal distri- 
bution to which the distribution of F(x n ) tends asymp- 
totically. 

As examples of problems studied under the first 
case, we can cite Fieller [1932] on the ratio of two 
normally distributed random variables, and Craig 
[1937] and Goodman [1962] on the product of two 
or more random variables. Tukey, in three Princeton 
University reports, extended the classical formulas 
through the fourth order terms for the mean and 
variance, and propagated the skewness and elon- 
gation of the distribution of F(x) as well. These 
reports present perhaps the most exhaustive treat- 
ment of statistical tolerancing to date. 

From now on we shall be concerned in this paper 
with the second case only, i.e., the problem of esti- 
mating the mean and variance, or standard deviation, 
of the normal distribution to which the distribution of 
F(x n ) tends as n increases indefinitely, and hence 
also the problem of using approximations to the 
mean and variance computed from a finite number of 
measurements. Since the mean and standard 
deviation are the parameters that specify a particular 
normal distribution, our problem is by its very nature 
less complicated than that of statistical tolerancing 
where the actual distribution of the function may 
have to be specified. We shall, however, utilize 
formulas given in Tukey's reports to check on the 
adequacy of some of the approximations. 



--F(x n ) 



2 A brief summary is given in paragraph 2.2. 
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2. Propagation of Error Formulas 

2.1. Definitions and Notations 

(1) X, Y, Z in capitals stand for the mathematical 
variables to be measured; x, y, z in lower cases stand 
for the measured values of these variables; Xi 9 yj, Zk 
with subscripts stand for the particular values of the 
ith measurement on x 9 the 7th on y, and the A:th on z, 
respectively. 

(2) W=f(X, F, Z) is a continuous function of the 
variables X, F, Z, with derivatives 



dW 3W 



dX ' BXdT 



etc. 



(3) All derivatives appearing in square brackets, for 



example 



raw 


1 


raw\ 



, stand for the values of these de- 



rivatives evaluated at the means of x and y, if known, 
or at the sample averages of x and y, if the means are 
not known. 

(4) In order'to emphasize the fact that the mean M, 
variance cr 2 and other population parameters are usu- 
ally not known, we list here symbols for both the 
estimators of population values and the population 
values. For a particular set of values of x, the values 
computed from these estimators are estimates, or 
computed values of these estimators. 



Estimators of parameters 


Corresponding population 
parameters 




1 " 




M x (mean = first moment) 


s 2 = 


1 " 
(=1 




(T x (variance = second 

central moment) 




n — 1 I n } 




Sxy 


1 V l 

" 1-1 


-x)( yi -y) 


(Txu — cr,,x (covariance) 




= llxiyi- 

n—l I 


(S*0(Sn)l 

n J 






S.vS,j 


* 


Pxy (correlation coefficient) 




Sx 




(T x (standard deviation of x 
about M x ) 




1 

Sx = —=. S x 




(T 1 1 1 1 • • 

0"jr = — — (standard deviation 
V/j of the average 3c, 
or standard error) 




Sx 

Vx= — 

X 




— (coemcient ol variation or 
Mx relative standard 
deviation) 



In addition, we use \kx\ to denote the bound for pos- 
sible systematic errors on the measurements of x. 
The bound of these errors, unknown in sign, is usually 
established or conjectured by the experimenter and 
its value is not based on the measurements in hand. 

2.2. General Theorem and Remarks 

As mentioned briefly in paragraph 1.2, the propaga- 
tion of error formulas are special applications of re- 
sults obtained in the study of properties of distributions 
of functions of sample moments. Doob [1935], Hsu 
[1949], and others have investigated the limiting 
distribution of functions of sample means relating to 
hypothesis testing. Curtiss [1943] derived the limit- 
ing means and variances of the several functions of 
variables in connection with transformations used in 
the analysis of variance. Cramer, in chapters 27 
and 28 of his classical treatise, proved two theorems 
and also discussed the asymptotic properties of distri- 
butions of functions of sample moments in detail. 
For convenient reference we shall phrase his theorems 
and remarks in terms of functions of sample averages, 
to serve as a basis of justification for the use of propa- 
gation of error formulas. 
Theorem (Cramer, pp. 366, 352-356) 

//, in some neighborhood of the point X = M x , Y = M y , 
the function F(X, Y) is continuous and has continuous 
derivatives of the first and second order with respect to 
the arguments X and Y, the random variable w = F(x, y) 
is asymptotically normal, the mean and variance of 
the limiting normal distribution being given by: 



mean w = F(M x , M y ) 



var w = 





2 O" 2 

n 


raF] 



-+2 



pF 

ax- 


P F 1 



(2.1) 



(2.2) 



Remark 1. (Cramer, p. 367) 

It follows from this theorem that any function of 
sample averages is, for large values of n, approxi- 
mately normally distributed about the value of the 
function determined by the mean values of the basic 
variables, with a variance of the form C/n, provided 
only that expressions (2.1) and (2.2) yield finite values 
for the mean and the variance of the limiting 
distribution. 
REMARK 2. (Cramer, pp. 367, 415, also Doob, Hsu) 

In general, the constant C in the expression of the 
variance will have a positive value. However, in 
exceptional cases C may be zero, which implies that 
the variance is of a smaller order than n~ l . Then 
some expression of the form 

nP{vb-F(M X9 M y )} 9 p>i, 

may have a definite limiting distribution, but this is 

not necessarily normal. 

Remark 3. (Cramer, pp. 366, 213-214) 

The function F(x, y) may be asymptotically normal 
even though the mean and variance of F{1, y) do not 
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exist, or do not tend to the mean and variance of the 
limiting normal form. Generally, if the distribution 
of a random variable w depends on a parameter n, 
and if two quantities M and <x can be found such that 



the distribution function of the variable 



w- 



M 



cr 



tends 



to $>(t) (normal distribution function with mean zero 
and standard deviation one) as rc— »°°, we shall say that 
w is asymptotically normal (M, cr). This does not 
imply that the mean and the standard deviation of 
w tend to M and cr, nor even that these moments 
exist, but is simply equivalent to saying that for any 
interval (a, b) not depending on n, 

lim Prob. (M + aa < w < M + ba) = 4>(6) - 4>(a). 

n— > °° 



mean €? = 
mean Xi = M x 
var x = var e* 

= mean e? = cr|. 

And for condition 2, we assume 

A< The errors in the measurements of x% (i=l, 2, 
. . . , ft) are statistically independent; in particular 
these errors are not correlated or associated in any 
way, i.e., 



mean (e, • €j) = 0, 



i*h 



1 



Thus for x — - (%\ + x% + 
n 

Furthermore, 



+ x n ), the mean of x is M x . 



EXAMPLE: If x is from a continuous distribution with 
positive mean and a finite variance but with positive 
probability that some x can take negative values, then 
the function In x is not even defined for all values of x, 
and therefore the mean of the function In x does not 
exist; yet where the mean of x has a positive value, 
(2.1) and (2.2) give the mean and variance of the lim- 
iting normal distribution. 

2.3. Propagation of Error Formulas 

Fortified with the general theorem stated in the 
preceding paragraph, we shall proceed to derive the 
traditional propagation of error formulas in an ele- 
mentary manner, making some comments and as- 
sumptions that may be of interest. It will be helpful, 
however, to explain first what is meant here by the 
term ''random error" in a measurement process. 

a. Random Errors 

In a measurement situation, we consider random 
errors typically to be the sum total of all the small 
negligible independent errors over which we have no 
control — interpolation in reading scales, slight fluctu- 
ation in environmental conditions, imperfection and 
nonconstancy of our senses, etc. Thus for a stable 
measurement process, we find that: 

(1) The measured values do follow a distribution, 
with small errors occurring more frequently than larger 
ones, and with positive and negative errors about 
balancing one another, and 

(2) there is no obvious trend or pattern in the se- 
quence of measurements. 

Let us denote the ith measurement of x to be 

where M x is the mean of all measurements for the 
measurement process, and €* the random error of 
measurement xu Then for condition 1, we assume 
A\\ The distribution of errors is symmetrical and 
bell-shaped, with mean zero and standard deviation 
cr>, or 



-M x = 



6J+62+. . 



€„ 



n 

By definition, the variance of x is 

.+e 2 + . 
mean (x-MJ -mean 



€„ 



n 



n 2 { 



mean 



+ mean 



2 €i€j 



— £ \ n mean (e,) 2 + ^ mean (€*€,•) 



Using assumptions A x and A 2 , we obtain 



_ 1 
var (x)=— arz 
n 



or the variance of the average of n independent meas- 
urements is - of the variance of an individual meas- 
n 

urement. 3 

Here the average x is a linear function of the in- 
dividual %'s, and the exact expressions of mean and 
variance of an average in terms of that of the individual 
values are well known. For functions that are not 
linear in the x's, we expand the function about the 
mean of x by the Taylor series, and assume that the 
function in the neighborhood of the mean can be ap- 
proximated by the lower order terms. For example, 
let 

w=f& 9 n 

x = M x + e x , 



3 If, however, the measurements are not independent, then this formula is incorrect since 
the means of products («*€/) are not equal to zero. In that case let 



ean (e,ej)=pijcr t ji , and p=2 P>il n ^ n ~ 1)> tnen var W s 



: {1-Mn-Dp}. 
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where each of e x and e y satisfies assumptions A\ and 
A2, then we can write 



F{x, y)=F(M x , M y ) + 

\d 2 F 



§F 

8X 



4 



dX 2 



e| + 2 



d 2 F 



dF 

dY 



€ x e y + 



d 2 F 



aP 



4] 



_dXdY] 

-\- terms of higher orders in e x and e y . (2.3) 
Or, neglecting terms of higher order than e| and e|, 

F<?,y)-F<Mx,MJ 



4 



Since the means of e. r and e (/ are 0, if we take the mean 
on both sides, 



%) = 


~dF~ 
.dX. 


ex 


+ 


dF 

.dY_ 


e» 




~d 2 F' 
_dX 2 _ 


4+ 


2 


ra 2 / 


(dY. 


6x6» + 


Fa 2 F~ 

.dY 2 . 



mean{F(x,y)-F(M x , M y )} 

"2 



{ 


~d 2 F~ 

_dX 2 _ 


a 2 +2 


r a 2 F " 


0>.v + 


ra 2 F] 
.ap_ 



r? • (2.4) 



Thus the mean of a function of values always differs 
from the value of a function of means by a quantity rep- 
resented by (2.4), approximately. If the function of 
means F(M X , My) is the value of interest, then to 
approximate F(M X , My) by the mean of F(x, y) would 
introduce an error, or bias, the magnitude of which 
depends on the functional form, the variances of and 
the covariance between x and y. If, however, we 
use the function of averages, F(x, y), then 



mean w= mean F(x, y) — F{M X , My) 



+ 



\{ 



\ff 2 Fl 


^+2 
n 


' d 2 F] 
_3XdY_ 


<** + 

n 


\d 2 Fl 
_dY 2 _ 



(2.5) 



and the bias is only l/n times that of the mean of the 
function of individual values. When n becomes large, 
this bias tends to zero, and (2.1) results. 

This bias can be calculated by (2.5) and compared to 
the standard deviation of w. In practice, if cr x and a y 
are small, the bias is often of a magnitude that is 
negligible. 

To propagate the variance, we note t hat if e x and e y 
are small in the sense that the second and higher order 
terms in (2.3) can be collectively neglected in compari- 
son to terms involving e x and e u only, then 



and the variance of F(x, y) is, approximately 
mean{F(x, y)-F(M x , M /7 )} 2 =mean 



dF 

dX 



e x + 



dF 

dY 



raF] 


2 


raF] 


2 


raFi 


raFi 


_dX_ 


0*+ 


_dY_ 


o- 2 + 2 


_dX_ 


BY. 



<r xy . (2.6) 



And for tb = F(x, y), the variance of tb is 

raF 



var(#>) — 



§F 

dX 



2 ^ + 

n 



dF 
dY 



'+2 



dF 

dX 



dY 



^ (2.7) 
n 



the limiting form of which is (2.2). 

Finally, if <xj, crjj, and cr X y are not known, we substi- 
tute their estimators in formulas (2.5) and (2.7), 
resulting in: 



mefen(fl?) = F(M X , M u ) + : 



d 2 F 



dX 2 



+ 



d 2 F 



dY 2 



^+2 



d 2 F 



dXdY 



^ , (2.8) 
n 



and 



var(fl<) 



[dX] n 



+ 



dF 

dY 



£2. 

n 



dF 

dX 



dF 

dY 



Sxy 

n 



(2.9) 



If we assume further that the random errors in meas- 
urements of x and y are independent, then a xy = 0, and 
the terms involving a xu in (2.5), (2.6), and (2.7) vanishes. 
If this is the case, the terms involving s X! , in (2.8) and 
(2.9) should also be dropped. This reduced version 
of the formula for independent x and y, 



var(fl;) = 



dF 

dX 



2 <rl , 


raF] 


— + 




n 


[dY] 



tl ' 



(2.10) 



is of the form given in Birge's paper and in other text- 
books on statistical analysis of data [Mandel, 1964, 
pp. 72-76]. 

For W = F(X, Y, Z), there will be three variance and 
three covariance terms in (2.5) and (2.7). Extension 
to more than three variables presents no new problems. 



b. Extension to More Than One Function of the Variables 

Let 



F(x,y)-F(M x ,M y ) = 


dF' 
dX. 


€.,- + 


raF] 

dY. 


ey, 


and 


U=g{X,Y,Z), 
V=h(X, Y,Z). 
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Then in addition to the above formulas, we have 



(Tuv = 



dU dV 
dX'dX 

+ 
+ 
+ 



o-t 



dU dV 
dY'dY 



cr* + 



(t: 



{ 


~dU dV~ 

_dX'dY_ 


+ 


~dU dV 
_dY'dX_ 


I 


~d(J bV 

_dY'dZ_ 


+ 


~dU dV 



'dU dV 

dZ'dX 



+ 



dU dV 

dX'dZ 



dU dV 

dZ'dZ, 

PxyCTxCTy 

P)jzCTy<T Z 



pzxtTzCTx 



(2.11) 



Expression (2.11) may be convenient to use to get 
(r(w) where W = F(U, V), and U and V are known func- 
tions of X. Y. and Z. 



When there are a number of systematic errors to be 
propagated, one approach is to take | Aw | as the square 
root of the sum of squares of terms on the right-hand 
side of (2.12), instead of adding together the absolute 
values of all the terms. This procedure presupposes 
that some of the systematic errors may be positive and 
the others negative, and the two classes cancel each 
other to a certain extent. 

The treatment of inaccuracy due to systematic errors 
of assignable origin but of unknown magnitudes is 
discussed in detail in section 4.2 of Eisenhart [1963]. 
Since there is no generally accepted standard method 
for combining several systematic errors, Eisenhart 
advised and we quote 

"Therefore, anyone who uses one of these methods for the 'com- 
bination of errors' should indicate explicitly which of these (or an 
alternative method) he has used.'"' 

Information on the source and magnitude of each con- 
tributing elemental systematic error is, of course, also 
essential. 



c. Some Frequently Used Formulas 

For convenience, a few special formulas for com- 
monly encountered functions are listed in table 1 with 
x, y assumed to be independent. These may be 
derived from the above formulas. 



2.4. Systematic Errors 

By a systematic error we mean a fixed deviation that 
is inherent in each and every measurement of x in a 
particular sequence of measurements. If the magni- 
tude and direction of the systematic error are known, a 
correction can be made such that M x = Xq, or the mean 
of the sequence of measurements is equal to the value 
sought after. If the sign of the systematic error is not 
known and the magnitude of the error can be only 
estimated to be within some reasonable bound |A#|, 
perhaps by experience or judgment, then Wl x is within 
the limits Xo — Ax and x {) -f Ax. 

For a function of two variables W=F(X, Y) then, a 
bound | Aw | for the systematic error in W is given by: 



I Am;! = 



3. Practical Accuracies at the Various Stages 
of Approximations 



[dX_ 


Ax 


+ 


_dY_ 


Ay 



assuming, as before, that Ax and Ay are small such that 
second and higher order terms in Ax and Ay are collec- 
tively negligible in the Taylor series expansion. Since 
ordinarily we do not know the signs of Ax and Ay, we 
have no choice but to add the absolute values of the 
two terms together, even though the signs of the values 
of the partial derivatives evaluated are known. (If the 
signs of either Ax or Ay is known, this information, of 
course, should not be ignored.) If these derivatives 
are evaluated at the point x and y, then the random 
components of error of x and y are required to be small 
so that these derivatives take approximately the same 
values as when evaluated at x () and y<>. 



3.1. From the preceding sections we observe that 
there are three stages of approximations: 

(1) In the Taylor series expansion (2.3), terms higher 
than the first partial derivatives are considered to be 
negligible. 

(2) w is approximately normally distributed for large 
n. Is the normal distribution still a good approxima- 
tion for small n? 

(3) If crj and cr\ are known, we obtain crfr 1 from (2.7), 
and we can use this value to construct a confidence 
interval 3a about w with the desired level of confidence 
(approximate) based on normal theory. If orf. and <r 2 
are not known, and s| and s* are calculated from a small 
number of measurements, what can we say about 
w using \8ly(w) calculated from (2.9)? 

To get some numerical feeling for the closeness of 
these approximations, we shall simplify matters by 
making the following assumptions which do not seem 
to be too restrictive in measurement situations: 
B\\ x and y are normally and independently distrib- 
uted, with the ratio M/cr not less than 10. 4 
(2.12) B< The functional forms used are the well-behaved 
ones that do not possess derivatives assuming 
unreasonably large values when evaluated at the 
averages of the individual variables. 
Thus for linear functions, such as 



W = AX + BY, 

the second and higher derivatives vanish, and (2.6) is 
exact. 

The adequacy of these approximations is studied in 
paragraphs 3.2 and 3.3 below. In paragraph 3.4 sug- 



3a See Natrella [1963]. sec. 1 to 7. also chs. 2 and 3. 

4 For notational convenience, the symbols w. x, y. cr x , <r y , etc., are used in this and the 
subsequent sections. The corresponding symbols for the average could be used by straight 
substitution. 
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Table 1. Propagation of error formulas for some simple junctions 
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*' It is assumed that the value of w is finite and real, e.g., y ¥- for ratios with v as denominator, x > for V.v and In x. 
**' Weighted mean as a special case of Ax + By, with cr. r and cr y considered known. 

t Distribution of w is highly skewed and normal approximation could be seriously in error for small n. 
tt See, for example. Statistical Theory with Engineering Applications, p. 301, by A. Hald (John Wiley & Sons. New York, N.Y.. 1952). 
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gestions are made on the use of the standard deviation 
calculated for w when the standard deviations of x and 
y are not known. Readers may wish to go directly to 
paragraph 3.5 for a summary of the conclusions. 

3.2. For x, y independently distributed and arbi- 
trary F(x, y), the first correction terms to (2.6) are 



~dF~ 

.dX_ 


[d 2 F~ 
_dX\ 


y*oi+ 


dF' 
.dY. 


~d 2 F~ 
_dY 2 _ 



Jitfr 



(3.1) 



where y is a measure of skewness of the distribution. 5 
Therefore these terms equal zero for x, y symmetri- 
cally distributed, a condition satisfied by assump- 
tion B\. 

The next order of correction terms involve crj, a 4 
and cr|crj5 and are usually negligible compared to terms 
in (2.6). These terms are 



dF 

dX 



a 3 F 



dX 3 



T x ai+ 



dF 

dY 



d 3 F 



dY 3 



r .v°1 



+ • 



+ 4 

dF 

dX 



d 2 F 



dX 2 



(T x -l)(4+ 



d 2 F 



d 3 F 



dXdY 2 



-f 



d 2 F 



dXdY 



dY 2 



+ 



(r y -i)oi 



d 3 F 



dX 2 dY 



dF 



dY 



J X KJy. 

(3.2) 



For functions involving powers of x and y less than 
three, some of the partial derivatives also vanish. 
For example, if W = XY, the only nonzero term of 
this order is cr|o"|, or 

Var (w) = M 2 a% + M\o% + crlcr 2 . 

The contribution of o-|cr| is less than 1 in 200 if Mjcr 
is larger than ten. 

For functional forms such as quotients, roots, and 
logarithms, the accuracy is usually adequate since 
powers of the means of the variables appear in the 
denominators of the partial derivatives. 

For the exponential function W = e x , the variance 
of w as given by (2.6) is 

Var(w)=e 2A/ o- 2 , 

whereas the exact formula 6 for the variance of w, 
when x is normally distributed, is 

VarW = e^ 2M (e^ 2 -l) 



o- 2 -f-^- + 



o-° 



2! 3! 



Here the variance of w as given by (2.6) underestimates 
the true variance by the factor given in the brackets, 
and the approximation could be seriously in error. 
(Note, however, the "exact" formula is correct only 
if x is exactly normally distributed. If x is only ap- 
proximately normally distributed, then both for- 
mulas are approximations.) 

For specific functions, formulas (3.1) and (3.2) given 
in Tukey's report can be used to check on the ade- 
quacy of the approximation. We quote Tukey's con- 
clusion in this respect: 

"The most important conclusion is that the classical propagation 
formula is much better than seems to be usually realized. Examples 
indicate that it is quite likely to suffice for most work." 

3.3 Next we look into the adequacy of the normal 
approximation. For this purpose we will define the 
first four central moments of the distribution of w as 
follows: 



mean (w — M w ) = 
mean (w — M w ) 2 = cr 2 
mean (w — M u ) 3 = ycr 3 
mean (w-M w ) 4 = Vcr 4 . 



(3.3) 



If w is normally distributed, y = 0, and T — 3. Fol- 
lowing Tukey, we shall define 

skewness = ycr 1 , and 
elongation = Fa 4 — 3CT 4 ; 

then both skewness and elongation are equal to zero 
when w is normally distributed. 

If x and y are normally distributed as assumed under 
Z?i, then in general w = F{x, y) is not normally dis- 
tributed unless the function form is linear. By a 
procedure similar to that used in the last section, the 
coefficients of skewness /3i and excess f3 2 of w can be 
calculated where: 



0i = 



[skewness w] 2 
[var w] 3 



/* 



elongation w 
[var w~\ 2 



3. 



If /3i is close to zero and y8 2 is close to 3, the normal 
approximation may be considered as adequate. 

The terms up to order cr 4 in the propagation of 
skewness for w = F(x, y), with x, y independent, are 
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"'For definition of y and I see (3.3). 

6 See, for example, The Lognormal Distribution, p. 8, by J. Aitchison and J. A. C Brown, 
Cambridge University Press, 1957. 
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(3.4) 
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For x, y normally distributed, only terms of order 
o -4 remain. If we take w = xy again as an example, 
then 

skewness w — 6M x M y a 2 x cr 2 y 

Q 36M 2 x M 2 ai<T$ 



[MIo%+M%o% + (t%(tIY 



Of course one can always compute the half-widths 
of the respective 100 (1 —a) percent confidence inter- 
vals for M x and for M u by the use of the Student's / 
statistic, and use (2.12) to get the half-width of the 
interval for M w , i.e., set 



Ax= '('-f)-»-'^ and - Ay= 'o- 



VF 



Neglecting a%(rl in the brackets in the denominator, 
and taking M/cr=l0, /3i is computed to be 0.045. 
Hence, for ftj = x y, where x and y are averages of four, 
the coefficient of skewness is reduced by a factor of 
four or equals 0.011 approximately. 

Similarly, terms up to order a 4 for the elongation 
of w=f(x, y), with x, y independent, are 



elongation 



dF 
dX 



(r x -3)oJ + 



dF 
dY 



OV 



-3)o-< 
(3.5) 



which is zero for x, y normal, 
elongation w 



Hence /3 2 = 



2 + 3 = 3, and no correction 



(Variance w) 
for elongation is necessary here. 

If we look up a table 7 of percentage points of dis- 



tribution of the standardized variate 



w -M lL 



with given 



and use (2.12) to get Aw. Then the interval w±Aw 
is a confidence interval for M w for a confidence co- 
efficient of at least (1 —a). This procedure, however, 
may be criticized on the ground of gross inefficiency 
in using the data. 

We may write (2.9) as 



/3\ and /3-j, we note that the changes of values are rather 
sensitive to /3\ and much less so to /$>• Thus the co- 
efficient of elongation is usually not as much a source 
of worry in the normal approximation as is the coeffi- 
cient of skewness. 

Formulas (3.4) and (3.5) and the table of percentage 
points allow us to check how good the normal approxi- 
mation is for a given number of measurements in the 
variables x and y. Table 2 gives some examples of 
results of such calculations. 

3.4 The third approximation concerns the use of the 
sample variance s 2 as an estimate of the population 
variance & 1 . If we know the precision of the proc- 
esses for the measurements of x and y, i.e., we know 
cr x and cry, <r lv can be computed from (2.7) and a con- 
fidence interval about w can be constructed with the de- 
sired confidence coefficient 1 — a by using the table of 
the normal probability integral. If a x and <jy are not 
known, then even if &ft can be computed from (2.9), 
the constants to be used for constructing a confidence 
interval with confidence coefficient 1 — a will be dif- 
ferent from those for known o~. 

To offer some guideline to the solution of this prob- 
lem, we again assume measurements on x and y to 
be independently and normally distributed. If the 
number of measurements is large (a rule of thumb 
could be n > 30), then (2.7) can be used assuming 
cr|, cr|, and cr xu are known. 



7 See Table 42. Biometrika Tables for Statisticians. Vol. 1. edited l>\ 
H. O. Hartley, The University Press, 1958. Also. pp. 79-84. 
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n— 1, and s y , 



, values of a 4 V 



For given degrees of freedom for s 

/, — 1. and given ratios of - — ;— 7 - x 

ktS x + \.2Sy 

statistic have been tabulated 8 for confidence coef- 
ficients of 0.99, 0.98, 0.95, and 0.90. The interval 



w±v VXisf+X^s.? 



(3.6) 



is a confidence interval with confidence coefficient 
\-a. 

These tables, however, do not contain values for 
'V for /? and A less than 10, 10, 8, and 6 for the re- 
spective confidence coefficients, and hence cannot 
be used for smaller samples. In addition, they are 
useful only for two independent variables x and y. 

Alternatively Welch [1947] has proposed the use 
of "effective degrees of freedom" for the estimated 
variance of w of the form 

vZr{w) = Xkisl 

The effective degree of freedom / is computed from 

(XkiSj) 2 



f~- 



^sflfi) 



(3.7) 



where/) is the degrees of freedom for 9 sf. 

In general / will be fractional. The t value with / 
degrees of freedom can be found or interpolated from 
the t table and the confidence interval computed as 

w±t, aX &$. 



B See Table 11, Biometrika Tables for Statisticians, Vol. 1; also Further critical values 
for the two-means problem, W. H. Trickett. B. L. Welch, and G. S. James. Biometrika 43, 
i and 195b. pp. 204-5. 

Mf .sf is computed from m measurements, the degrees of freedom is n\— 1. 
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Table 2. Departures from normal approximations 

x, y independently distributed, with y = 0, f = 3, and (A//tr)= 10. 
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t Exact when x and y are normally distributed. 



The approximate confidence intervals computed by 
the use of effective degrees of freedom were found to 
check the exact confidence intervals given by (3.6) 
very well over the range of the latter. 

3.5 In summary, the following may be concluded 
for practical purposes: 

(1) Terms of order higher than o~ 2 in the propagation 
of error formulas for variance, (2.6) and (2.7), can be 
neglected if (a) the standard deviations are small in 
comparison to their respective means, and (b) the 
second and higher order partial derivatives evaluated 
at the means do not give rise to abnormally large num- 
bers. This is usually true in the field of physical 
science, since errors of measurements are usually 
of the order of 1 part in 1000, or parts per million; 
furthermore, the functional forms used are usually 
the well-behaved ones. 

(2) The normal approximation will be adequate for 
large n, or if, in addition to (a) and (b) above, (c) the 
individual variables can be assumed to be normally 



distributed. For particular functions, the approxi- 
mate values of the coefficients of skewness and elon- 
gation may be calculated and Pearson's table can be 
used to check the adequacy of the approximation. 

(3) For the case where the standard deviations of 
the individual variables are unknown, and are esti- 
mated from the data, confidence intervals for the 
estimate fo can be constructed either by the use of 
tabulated values of the 6 V statistic or by the use of 
effective degrees of freedom. These confidence 
intervals can be considered as a form of "precision 
limits" in the sense that if one makes the same sets 
of measurements a large number of times under the 
same conditions, and constructs the confidence in- 
tervals each time by the same procedure, then a large 
proportion of the intervals so constructed, 100 (1 — a) 
percent, will bracket the mean of all these sets of 
measurements. When only one set of measurements 
will be made, the probability is 1 —a that this interval 
will bracket the mean. 
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4. Reporting of Results 

4.1. Suppose a set of measurement data is avail- 
able, and, by using the appropriate propagation of 
error formulas, the following are obtained for the 
quantity of interest, w$: 

(1) The estimate of Wo, w, based on n values of x,y, 
etc.; 

(2) the estimated standard error of w, cr£, and asso- 
ciated degress of freedom /; 

(3) limits to the systematic error in w, Am;. 

The estimated standard error of w gives a measure 
of precision of the experimental results, or a measure 
of scatter of the values of w from the average value of 
M w for repeated performance of the particular experi- 
ment. But this measure of precision does not indicate 
at all how close this average value is to the value Wq 
intended to be measured. The estimation of limits to 
the systematic error is an essential part of an experi- 
ment and need not be discussed here [Youden, 1961]. 
One may remark generally that systematic errors 
usually do not pose a serious problem when the 
"imprecision" is large, since these systematic errors 
are, so to speak, "swallowed up" by the random errors. 
The systematic errors, however, play an important 
role when the precision is excellent and is of about the 
same order of magnitude as the systematic error. In 
that case, it is essential that the systematic error, or 
errors, be reported separately from the imprecision 
part of the reported value, as measured by the standard 
error, or the confidence intervals, computed. 

In scientific literature, it is not uncommon to come 
across expressions of results in the form of M±e, 
where "A/" is an average of some kind and "e" repre- 
sents the uncertainty of "M" in some vague sense. 
This type of reporting proves to be most frustrating 
from the reader's point of view. From the context 
alone the reader cannot possibly infer whether "e" 
represents probable error, 3-sigma limits, systematic 
error, or some combination of random and systematic 
errors. As a consequence, the quality of the results, 
and the validity of inference drawn from these results, 
are to a large extent left to the judgment and guesswork 
of the reader. Hence, the writer owes to himself, and 
to his reader, to specify clearly the meaning of "e" 
as he uses it. In particular, the number of measure- 
ments from which the measure of random error was 
computed and the manner in which the systematic 
error was estimated are both essential elements of the 
reported value and need to be included. 



A footnote explaining the role of "e" is often very 
helpful. Several examples are given below: 

"In the expression of the form M±e, M is the average and e is the 
standard error of M based on n measurements (or based on v degrees 
of freedom)." 

"The indicated uncertainty limits for M are overall limits of error 
based on ( ;f) percent confidence limits for the mean and on allow- 
ances lor effects of known sources of possible systematic error " 

"The uncertainty given represents 3-sigma limits based on the 
current accepted value of the standard deviation, known sources of 
systematic errors being negligible" 

Chapter 23 of Natrella [1963] "Expressions of the 
Uncertainties of Final Results" gives a thorough dis- 
cussion on this topic, and is an excellent reference 
for all physical scientists who have occasion to report 
numerical results of their experiments. 
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